Scalable Knowledge Discovery in Complex Data with Pattern Structures

نویسنده

  • Sergei O. Kuznetsov
چکیده

Pattern structures propose a direct way to knowledge discovery in data with structure, such as logical formulas, graphs, strings, tuples of numerical intervals, etc., by defining closed descriptions and discovery tools build upon them: automatic construction of taxonomies, association rules and classifiers. A combination of lazy evaluation with projections of initial data, randomization and parallelization suggest efficient approach which is scalable to big data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fitting Pattern Structures to Knowledge Discovery in Big Data

Pattern structures, an extension of FCA to data with complex descriptions, propose an alternative to conceptual scaling (binarization) by giving direct way to knowledge discovery in complex data such as logical formulas, graphs, strings, tuples of numerical intervals, etc. Whereas the approach to classification with pattern structures based on preceding generation of classifiers can lead to dou...

متن کامل

Visual pattern discovery in image and video data: a brief survey

In image and video data, visual pattern refers to re-occurring composition of visual primitives. Such visual patterns extract the essence of the image and video data that convey rich information. However, unlike frequent patterns in transaction data, there are considerable visual content variations and complex spatial structures among visual primitives, which makes effective exploration of visu...

متن کامل

New Applications of Formal Concept Analysis: A Need for Original Pattern Domains

We survey the results obtained by our research group (joint work with Jérémy Besson and Löıc Cerf, Kim-Ngan T. Nguyen, Marc Plantevit, and Céline Robardet) concerning the design of pattern domains to support knowledge discovery and information retrieval in arbitrary n-ary relations. Our contribution is related to Formal Concept Analysis and its recent developments in direction of, for instance,...

متن کامل

Scalable Link Discovery for Modern Data-Driven Applications

The constant growth of volume and velocity of knowledge bases on the Linked Data Web has led to an increasing need for scalable linking techniques between resources. Modern data-driven applications often have to integrate large amounts of data relaying on fast but accurate Link Discovery solutions. Hence, they often operate under time or space constraints. Additionally, most Link Discovery fram...

متن کامل

AMKIS: An Algorithm for Association Mining

Mining frequent items and itemsets is a daunting task in large databases and has attracted research attention in recent years. Generating specific itemset, K –itemset having K items, is an interesting research problem in data mining and knowledge discovery. In this paper, we propose an algorithm for finding K itemset frequent pattern generation in large databases which is named as AMKIS. AMKIS ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013